Multi-Granularity Retrieval Model for Bridging Gaps between Biomedical Concepts and Entities: THUIR at TREC 2007 Genomics Track

نویسندگان

  • Hongning Wang
  • Jiao Li
  • Shilin Ding
  • Xiaoyan Zhu
چکیده

Abstract General concepts are always used to describe query requirement (In the example “What tumor types are associated with Rb1 mutations?”, “Tumor types” is a general concept, and its entity in a relevant documents can be “brain tumor”). To bridge the gaps between concepts in user queries and entities in relevant documents, we proposed a multi-granularity retrieval model in TREC 2007 Genomics task. The model consists of three components: (1) Paragraph retrieval is employed to retrieve candidate paragraph initially; (2) Dictionary-based NER is utilized to recognize named entities of given types; (3) Passage ranking is used to rank retrieved candidate passages. Our proposed model achieve promising result (Passage MAP=0.1023, with NER bottleneck eliminated).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

IIT TREC 2007 Genomics Track: Using Concept-Based Semantics in Context for Genomics Literature Passage Retrieval

For the TREC-2007 Genomics Track [1], we explore unsupervised techniques for extracting semantic information about biomedical concepts with a retrieval model for using these semantics in context to improve passage retrieval precision. Dependency grammar analysis is evaluated for boosting the rank of passages where complementary subject/object concept pairs can be identified between queries and ...

متن کامل

Learning Domain-Specific Knowledge from Context--THUIR at TREC 2005 Genomics Track

We(Tsinghua University) participated both Ad Hoc Retrieval Task and Categorization Task in TREC2005 Genomics Track, in which we designed and implemented a serious of methods encompassed learning domain-specific knowledge from context. In Ad Hoc Retrieval Task, internal resource is introduced to expand query, different granularity indexing provides more flexible retrieval space, and pattern disc...

متن کامل

IIT TREC 2006: Genomics Track

For the TREC-2006 Genomics Track, we report on the effectiveness of composite information retrieval functions based on a dimensional data model for improving document, passage, and aspect search precision of genomics literature. We designed an approach, and developed a corresponding search engine, based on a novel dimensional data model capable of document, paragraph, sentence, and passage leve...

متن کامل

THUIR at TREC 2004: Genomics Track

This is the first time that THUIR participates in TREC Genomics Track. We took part in both Ad hoc retrieval task and Categorization task. Based on our retrieval system TMiner, our research in the Ad hoc retrieval task focuses on: (1) Category of organism retrieval strategy; (2) Primary Feature Model; (3) Query Expansion (QE) technology; (4) Result fusion method. Five official runs have been su...

متن کامل

DUTIR at TREC 2007 Genomics Track

This paper describes our experiments on TREC 2007 Genomics Track which is concerned with question answering extraction from full-text biomedical literatures. In our experiment, named entities were recognized at the preprocessing stage using a two-view method. MeSH was used to expand the terms. We performed passage retrieval by using sentence-level half overlapped sliding windows. Indri structur...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007